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The aim of this note is to bring to the attention of a wide mathematical au- 
dience the recent apphcation of division algebras to wireless communication. The 
application occurs in the context of communication involving multiple transmit and 
receive antennas, a context known in engineering as MIMO, short for multiple in- 
put, multiple output. While the use of multiple receive antennas goes back to the 
time of Marconi, the basic theoretical framework for communication using multiple 
transmit antennas was only published about ten years ago. The progress in the 
field has been quite rapid, however, and MIMO communication is widely credited 
with being one of the key emerging areas in telecommunication. Our focus here 
will be on one aspect of this subject: the formatting of transmit information for 
optimum reliability. 

Recall that a division algebra is an (associative) algebra with a multiplicative 
identity in which every nonzero element is invertible. The center of a division 
algebra is the set of elements in the algebra that commute with every other element 
in the algebra; the center is itself just a commutative field, and the division algebra 
is naturally a vector space over its center. We consider only division algebras 
that are finite-dimensional as such vector spaces. Commutative fields are trivial 
examples of these division algebras, but they are by no means the only ones: for 
instance, class-field theory tells us that over any algebraic number field there 
is a rich supply of noncommutative division algebras whose center is K and are 
finite-dimensional over K . 

Interest in MIMO communication began with the papers [201 E ESI HQ] where it 
was established that MIMO wireless transmission could be used both to decrease 
the probability of error as well as to increase the amount of information that can be 
transmitted. This caught the attention of telecommunication operators, particu- 
larly since MIMO communication does not require additional resources in the form 
of either a larger slice of the radio spectrum or else increased transmitted power. 

The basic setup is as follows: Complex numbers re**^, encoded as the amplitude 
(r) and phase (^) of a radio wave, are sent from t transmit antennas (one number 
from each antenna), and the encoded signals are then received by r receive antennas. 
The presence of obstacles in the environment such as buildings causes attenuation 
of the signals; in addition, the signals are refiected several times and interfere with 
one another. The combined degradation of the signals is commonly referred to 
as "fading" , and achieving reliable communication in the presence of fading has 
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been the most challenging aspect of wireless communication. The received and 
transmitted signals are modeled by the relation 

where X is a t x 1 vector of transmitted signals, F is an r x 1 vector of received 
signals, is an r x 1 vector of additive noise, H is a.nr x t matrix that models the 
fading, and is a real number chosen to normalize the transmitted signals so as to 
fit the available power. Under the most commonly adopted model, the entries of 
the noise vector W and the channel matrix H are assumed to be Gaussian complex 
random variables that are independent and identically distributed with zero mean. 
(A Gaussian complex random variable is one of the form w — x + ly where x 
and y are real Gaussian random variables that are independent and have the same 
mean and variance. The modulus of such a random variable, and in particular the 
magnitude of of each fading coefficient hij , is then Rayleigh distributed. This model 
is hence also known as the Rayleigh fading channel model.) It is the presence of 
fading in the channel that distinguishes this model from more classical channels, 
where the primary source of disturbance is the additive Gaussian noise W. 

The transmission typically occurs in blocks of length n: each antenna transmits 
n times, and the receiver waits to receive all n transmission before processing them. 
A common engineering model is to assume that r — t ^ n, so the equation above 
is accordingly modified to read 

(1) Ynxn — SHnxn^nxn + Wnxn- 

Thus, the ith column of F, X, and W represent (respectively) the received vec- 
tors, the transmitted information, and the additive noise from the ith transmission. 
In the model considered here, the fading characteristics of the channel (i.e., the hij) 
are assumed to be known to the receiver, but not to the transmitter (this is known 
as coherent transmission). A measure of the power available during a single trans- 
mission from all n antennas, i.e., a single use of the telecommunication channel, 
is the signal-to-noise ratio (SNR) p. Recall that the Frobcnius norm ||Ar||i? of 

X — {xij ) equals ^X^ij- Ni.iP- Since the power required to send a complex num- 
ber varies as the square of its modulus, the normalization constant 9 must satisfy 
6*^11^111 < np. 

A subset S of the nonzero complex numbers known as the signal set is selected as 
the alphabet (a common situation is that S" is a finite subset of size q of the nonzero 
Gaussian integers — {0}), and a fc-tuple (si, S2, ■ • ■ , Sfe), Si G S, comprises the 
message that the transmitter wishes to convey to the receiver. Thus there are 
q'' messages in all and it is assumed that each message is equally likely to be 
transmitted. A space-time code is then a one-to-one map X : S'^ Af„(C); we 
write X for X{S''). The transmitted matrix OXnxn in Equation ([T]) is thus drawn 
from the set OX as (si, S2, . . . , s^) vary in . Often X itself is referred to as the 
space time code. It is typically assumed that the map X is "linear in S*^" , that is, 
it is the restriction to of a group homomorphism {S)^ Mn{C), where (S) is 
the additive subgroup of C generated by S. 

Under the information-theoretic framework developed by Shannon in 1948 ([17]) 
and adopted ever since within the telecommunication community, the amount of 
information conveyed by a message in this setting is equal to log2{q'') "bits". Since 
this amount of information is conveyed in n transmissions over the MIMO channel. 
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the rate of information transmission is then given by log2(g) bits per channel use. 
When q and n are fixed a priori, the quantity k serves as a measure of information 
rate. 

RehabiUty of communication is commonly measured by the probability Pg of 
incorrectly decoding the transmitted message at the receiver. The pairwise error 
probability Pe{i,j) is the probability that message i is transmitted and message j is 
decoded. Performance analysis of MIMO communication systems typically focuses 
on the pairwise error probability as it is easier to estimate and also because the 
error probability can be upper and lower bounded in terms of the pairwise error 
probability. 

It was shown in [501 [TU] that for a fixed SNR (i.e., power) p, in order to keep the 
pairwise error probability low, the space time code X must meet the two criteria 
below, of which the first is primary: 

(1) Rank Criterion: For (si, S2, . . . , Sfe) 7^ (s'j^, Sg, . . . , s^.), X{si, S2, ■ ■ ■ , Sk) — 
X{s'i, s'2, . . . , s^)) must have full rank n, i.e., it must be invertible. 

(2) Coding Gain Criterion: For (si, S2, . . . , Sk) 7^ {s'l, s'2, . ■ ■ , sj.), the modulus 
of the determinant of difference | det{9X{si, S2, . . . , Sk)—dX{s'i,s'2, ■ ■ ■ , s'j,))| 
must be as large as possible. 

Clearly, the second criterion comes into play only when the first criterion has been 
met. Note that one cannot arbitrarily scale the matrices X to increase the coding 
gain because the assumption of fixed p along with the relation 6*^1 |X| ||, < np would 
cause a corresponding decrease in 6. Note too that the second criterion shows that 
one cannot increase the quantity k (a proxy for the rate of information) arbitrarily, 
as this would create a larger set of matrices 9X all circumscribed to lie within a 
sphere of radius y/np, which would entail that the determinant of their differences 
would get smaller, thereby increasing the pairwise error probability. 

Satisfying the Rank Criterion 

The earliest space-time code, for two antennas, was given by an engineer Alam- 
outi ([I]): given an arbitrary signal set S, he chose X: 5^ — > M2(C) to be 

(2) X{s^,.S2)^( ^1 ^ ) 

\ 32 Sl J 

(where stands for complex conjugation). It is easy to see that the rank criterion 
is immediately met. Writing si = mi + iU2, S2 = U3 + lUi, each such matrix can 
be expressed in the form X{si, S2) = f^i^i- The 2x2 complex matrices Aj 

are such that for any complex 2x2 channel matrix H, the collection of 2 x 2 
matrices {HAi} is pairwise orthogonal when regarded as vectors in by writing 
out sequentially the real and imaginary parts of each entry of the {HAi}. The 
expansion above makes it possible to do a least squares estimation of the Uj from 
the received matrix Y, also considered as a vector in as above, by projecting 
onto the respective matrices HAj (we will consider this in more detail later). It is 
this property that makes the Alamouti code so easy to decode, and not surprisingly, 
the code has since been adopted into the IEEE 802.1 In "Wireless LAN" standard. 
In applications, the {si, S2} are typically drawn from a subset of Z[«] x Z[z]. 

Alamouti's code led to a furious search among engineers and coding theorists 
for generalizations for higher number of antennas. Much of the early work (see 
[21] for example) focused on combinatorial methods. The matrix X in Equation 
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([2]) is almost unitary: it satisfies XX'^ = (sisT+ 3282)12, where the superscript f 
stands for transpose conjugate, and I2 stands for the 2x2 identity matrix. Not 
surprisingly, early workers (see '21] for example) sought nxn matrices X{si, . . . , Sk) 
whose entries come from the set {±Sj, ±sj, ±iSj, ±iSj, j = 1, . . . , k} and satisfy 

(3) = (sisT+--- + Sfesr)/„ 

This quickly leads to a necessary condition: the existence of 2fc — 1 complex nxn 
matrices Ai satisfying AlAi = /„, A J = —Ai, and AiAj = —AjAi for 1 < i < j < 
2fc — 1. These are of course the Hurwitz-Radon-Eckmann matrices, and classical 
results of Hurwitz-Radon-Eckmann (see [5] for instance) severely limits the values 
of k for which such matrices can exists, li n = 2''(2&-f 1) then the Hurwitz-Radon- 
Eckmann result says that the maximum possible value of k equals (a + 1). Thus 
/c = n if and only if n = 2, fc < ^ for n > 2, and A: < ^ for n > 4. It follows that 
these generalizations of the Alamouti code transmit too few information symbols 
for more than two transmit antennas. (A similar analysis of the matrices Ai using 
representation of Clifford Algebras was made by Tirkkonen and Hottinenin 19J.) 

In 2001, Sundar Rajan, a professor of communication engineering at the Indian 
Institute of Science introduced the problem of designing matrices X{si,...,Sk) 
satisfying the rank criterion to this author. Given his algebraic background, this 
author could recognize easily that matrices arising from embeddings of fields and 
division algebras can be utilized to solve this problem. Let f : D ^ M„ (C) be an 
embedding, i.e., an (injective) ring homormorphism of a division algebra D into the 
nxn matrices over C. Then for Xi = /(c?i) and X2 = f{d2) {Xi ^ X2), Xi — X2 
must necessarily be invertible. This is because di — being a nonzero element of 
the division algebra D, is automatically invertible, and since / is a homomorphism, 
the same must also be true of Xi — X2. Thus, the matrices in f{D) automatically 
satisfy the rank criterion. Using this observation, Sundar Rajan, his Ph.D. student 
Shashidhar, and this author ( 18 ) proposed several schemes for constructing space- 
time codes from various signal sets. For each signal set S and for each n, they 
constructed suitable division algebras D, suitable embeddings f : D ^ Af„(C), and 
suitable injective maps X : S'^ ^ f{D), for suitable k. 

For simplicity of construction in the noncommutative case, the authors of ( jl8p 
used cyclic division algebras for their codes. A cyclic division algebra is constructed 
from two data: a field extension K/F of degree n that is Galois with cyclic Galois 
group (a), and a nonzero element j £ F that satisfies the property that for any 
i = 1, . . . , n — 1, 7' is not a norirQ from K to F. As a iiT- vector space, the algebra 
is expressible as 

n-l 

Z? = Ku' 

i=0 

where u is a symbol. The multiplication in this algebra is given by the relations 
uk = a{k)u for all k € K, and u" = 7. The bilinearity of multiplication along with 
these relations then allows us to determine the product of any two elements of D. 
One can prove that this construction indeed yields a division algebra with center 
F. (Such a division algebra is said to be of index n.) 



this is a sufficient condition to obtain a cyclic division algebra 
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There is a well-known embedding of such a D into Mn{K) that sends fco + km + 

■ ■ ■ kn-lU^^^ to 



(4) 



fco 

k2 
ks 


7Cr(fc„_i) 
a-(fco) 

^T(fcl) 
^(fc2) 


7Cr2(fc„_2) . 
7Cr2(fc„_;^) ^ 

<J^{ko) . 


7Cr"- 
7Cr"- 
7Cr"- 

7ct"- 


-^(fcl) 

-Hk2) 
-Hks) 
~\k,) 


n-2 
n — 1 


0-(fcri-3) 
0-(fc„-2) 


CT^(fc„_4) ■ 
CT^(fc„_3) ■ 


. 7Cr"-l 
cr"- 


{kn~l 

Hko) 



By taking F to be various subfields of C containing Q{S) (the field generated 
by the elements of S over Q) in this formulation, and for each such F taking 
various K and 7, a wide variety of space-time codes can be constructed for a wide 
range of signal sets. For further simplicity of construction, particularly in the 
selection of the element 7 above, the authors of ([H]) chose all their base fields 
F to contain transcendental elements; in most cases, their cyclic extensions K/F 
were of the form Ko{x)/ Fq{x), where Kq/Fq is a cyclic extension of number fields, 
and X is a transcendental. In these cases, the authors' construction yielded codes 
X: S*"' ^ M„(C), i.e., with fc = n^. 

Alamouti's original code above arises as a special case of this formulation: the 
matrices of Equation ([2]) arc just the matrices of Equation (|4]) above specialized 
to the cyclic algebra (C/R, cr, — 1), where a stands for complex conjugation. This 
is nothing other than Hamilton's quaternions: the four-dimensional R algebra K ® 
Ri © Rj ® Rfc subject to the relations i"^ = = —1, ij = —ji ~ fc. (The signal set 
in Alamouti's construction is contained in K instead of F, unless of course if S is 
real.) 

Satisfying the Coding Gain criterion 

The coding community immediately recognized the potential of cyclic division 
algebras as a fundamental tool for constructing space-time codes and began to 
work with the coding paradigm introduced in [TS]. However, there was still a 
drawback. While the specific codes of [THj certainly satisfied the rank criterion, their 
performance was not satisfactory. The reason for this became clear: the specific 
division algebras of |18| were proposed only for mathematical simplicity-merely as 
easy examples of the larger paradigm of division algebras-and were not optimized 
for the coding gain performance criterion above. The use of transcendental numbers 
in the codes in [TH] caused the determinants of the difference matrices to come 
arbitrarily close to zero and limited their performance. 

This situation was quickly remedied in [2] by a very clever technique. To provide 
a lower bound on the modulii of the determinants of the difference of code matrices, 
the authors Belfiore, Rekaya and Viterbo first constructed division algebras from 
cyclic extensions K/Q{i) and 7 G but then restricted the various ki in the 
matrix Q above to entries in Ok, the ring of integers of K. The net result, 
as can easily be seen, is that the determinant of the difference of any two such 
matrices will live in Z[z], and therefore will have modulus bounded below by 1. 
Moreover, this will be true no matter how large a subset of Z[i] is used as the signal 
set. They called this last property the "nonvanishing determinant property" and 
they called the specific code they proposed the Golden Code. It was so named for 
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the Golden Ratio that appears naturally: it is derived from the division algebra 
(Q(i, \/5)/Q(«), CT, i). Here, a is the automorphism of K — \/5) that sends -\/5 
to — \/5 and acts as the identity on Q(i). A Z-basis for Ok is given by 1 and 
= ii^. Write ^ for a{(j)) = For a signal set S C C Q(z) (the most 

common kind of signal set), this code sends 5"* to M„(C) via the matrix 

Here, the ^= scale factor, a = l + i{l — (f)), and 9 = (7{a) = 1 + t{l ~ tp) 
v5 

are used to shape the code (more on this ahead). Comparing with the matrix 
(HI above and ignoring the scale factor, we see that fco = so,ia + so,2«'/' and fci = 
si_ia+si^2Ct4>- Note that this code encodes four information symbols in each matrix. 
(A variant of this code, also based on the division algebra (Q(i, VE)/Q{i), cr, i), also 
incorporating the shaping criterion described ahead, is currently part of the IEEE 
802. 16e "WIMAX" standard. The Alamouti code based on the quaternions is also 
part of this standard.) 

With the introduction of cyclic division algebras as a fundamental construction 
paradigm and with the use of codes constructed with entries from Ok for suitable 
extensions of (Q(«), the subject of space-time coding took off. It is harmless and 
very often actually useful to assume that the signal set S is infinite: typically, 
S is assumed to be one of the standard lattices Z, Z[z] or the Eisenstein lattice 
Z,[lu], where lu stands for the primitive third root of unity — '-"y^— ^. (Under these 
assumptions the code forms an additive group, so one only needs to consider the 
rank of X{si, . . . , Sk) and the modulus of the determinant | det 9X{si, . . . , Sk)\ in 
the rank criterion and the coding gain criterion.) Coding theorists immediately 
looked for specific constructions of division algebras of the form {K/F, a, 7) for 
the cases where F = Q, F = Q{i), and F = Q(\/^), corresponding to signal 
sets equaling one of the three lattices above. While such constructions have been 
known in principle to mathematicians working with division algebras, the coding 
theorists absorbed the necessary number-theoretic background in very short order 
and explicitly constructed division algebras over such fields for all indices n ([T3] 
and [6]). (The hard task here is to select j £ Op so that it has the property that 7' 
is not a norm from K to F for i = 1, . . . , n — 1.) In all such cases, an Oj^-basis f3j of 
Ok is chosen, and each ki is written as J2^=i t;he signal set. Thus, 

elements from the signal set are coded in each matrix, and by construction, the 
determinant of each matrix is nonzero and lies in one of the discrete lattices above. 
The modulus of the determinant will therefore be bounded below by the length of 
the shortest vector in the lattice so the code will have the nonvanishing determinant 
property. 

Other Performance Measures 

In parallel, as the subject became better understood, several additional perfor- 
mance criteria started to be imposed on codes. In a fundamental paper [53], Zheng 
and Tse provided a precise quantification of the trade-off (known as the diversity- 
multiplexing gain or "DMG" tradeoff) between information rate and reliability. 
They defined numerical measures for each of the benefits, and showed that the pair 
of benefits lie in a region of the first quadrant whose upper boundary is a piecewise 
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linear concave up curve. In the paper [6] Vijay Kumar and his students showed 
that aU codes constructed from cychc division algebras with the additional nonva- 
nishing determinant property will automatically perform at the upper boundary of 
this region, and will hence be "DMG optimal." This of course further cemented 
the use of cyclic division algebras for code construction. 

Another set of criteria were proposed by Oggier and coworkers in the paper [16j . 
One first rewrites the matrix (l4|) as a single ri^ x 1 vector. When ki = ^ijf^j 
for Si J in the signal set and Pj an Of basis for Ok, this ri^ x 1 vector can be 
expressed as M.v, where M is an x matrix and v is the column vector 
(so,i, So, 2, • ■ • , Si,j, ■ ■ ■ , Sn-i,n)'^ ■ Onc now requires that the matrix M be unitary 
and that I7I = 1. The first condition is called "good shaping" and the idea behind it 
is that this forces the average energy needed to send the vector v without coding to 
be the same as that needed to send it in the coded matrix form Q . The condition 
I7I — 1 causes the average energy transmitted per antenna to be equal for all 
transmission. Oggier and coworkers called such codes "perfect" and constructed 
perfect codes for n = 2, 3, 4 and 6. This was followed by work of Elia and coworkers 
([?]) who constructed perfect codes for all values of n, and additionally, showed that 
perfect codes satisfy other information-theoretic properties such as information- 
losslessness and approximate universality. 

The mathematics needed for the work on perfect codes is quite interesting. An- 
alyzing the condition that M be unitary, we find that it is sufficient to make the 
following matrix unitary: 



C/({/3i,..., = 



f3i ■■■ f3n 



Here, it is not necessary that the Pj be an basis of Ok, it is sufficient that they 
be an Op linearly-independent subset of Ok- (So, for example, in the Golden Code 
([5]) above, a is chosen that with Pi = a and P2 = Oi4>, the matrix 

a acj} 


is unitary after being multiplied by the — p scale factor.) So the question is: how 

v5 

to find Op submodules of Ok that satisfy this unitary condition? For n = 2'', it is 
easy to see that for the field K = Q(C) and F = Q{i), where C is a primitive 2''+^-th 
root of unity, the various powers of C are Z[i]-linearly independent and satisfy the 
unitary condition above. For odd n, Elia and coworkers use a construction due to 
B. Erez ([8]) that was needed in a different context: Erez was showing that for 
certain cyclic extensions K/Q with Galois group G, the square-root of the inverse 
different is a free Z[G] module which has an orthogonal basis with respect to the 
usual trace form on K that sends x,y to TrK/qixy). 

The most recent performance criteria for space-time codes, and in some sense 
the most mathematically exciting, have come from Lahtonen and coworkers ([12j). 
For the usual cases where S is one of Z, Z[z], Z[a;], it is easy to see from the 
linearity of the code matrices X that on writing each X as an x 1 vector as 
above and separating the real and imaginary parts, one gets a full lattice in R , 
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i.e., the additive group generated by 2?!^ linearly independent vectors in M^" . 
We refer to this lattice as the code lattice. After normalizing all code matrices so 
that mtx£X \ det(X)| — 1, they postulate that codes whose lattice points are the 
most dense in R " will have the best performance, and indeed, they find this is 
borne out in several circumstances by simulations. To obtain a suitable numerical 
measure for the relative density, they invert the situation: they normalize the code 
lattice to have fundamental volume 1 instead. Thus, they define the normalized 
minimum determinant of a code lattice A of rank 2n^ in a Q(i) division algebra of 
index n (embedded in M„(C)) as the minimum of the modulii of the determinants 
I det(A(si, . . . , s„2))| as A(si, . . . , s„2) runs through the lattice, divided by the 
fundamental volume of A. Since a smaller fundamental volume represents a higher 
density, the goal is to construct codes whose code lattice A would maximize this 
ratio among all full lattices in the division algebra. 

Recall that if is a division algebra with center F and if i? is a subring of F 
whose quotient field is F, then an i?-order in _D is a subring T oi D containing R 
that is finitely generated as an i?-module and satisfies TF = D. A maximal i?-order 
is one that is maximal with respect to inclusion. In the typical situation where S 
is one of Z, Z[z], or so F is one of Q, Q(«), or (Q)(-\/— 3), and where the ki of 

the matrices in (|4]) are constrained to lie in Ok and 7 e Oi?, the code matrices 
of ([ll naturally form an S'-order. Thus the code matrices have a dual structure 
of an iS-order and a full lattice in M'^" . Lahtonen and coworkers investigate the 
interplay between these two structures. They ask: how will the code's performance 
as measured by its normalized minimum determinant vary if, in addition to carry- 
ing its natural structure of a full Z-lattice in R , we choose our code to form an 
arbitrary 5-order inside an i^-division algebra? In these cases, the minimum mod- 
ulus of the determinants of the code matrices is 1, so it follows from the definition 
of the normalized minimum determinant that the smaller the fundamental volume 
of the lattice the better the code. If Ti and T2 are ^-orders and A^i and Atj the 
corresponding lattices with fundamental volumes and , then Ti C T2 implies 
Ati C , which in turn means that 1% < Vt^ ■ It follows therefore that the best 
normalized minimum determinant will arise when a maximal order is used for the 
code. The authors then relate the fundamental volume of the code lattice to the 
Z-discriminant of the maximal order, and then invoke known formulas for discrim- 
inants of maximal orders to compute the best normalized minimum determinant of 
codes arising from Op orders inside a given division algebra. In particular, they 
show (for the fields Q(?), Q(\/~3) and Q) that the best division algebras to use 
will be ones that are ramified at precisely two of the "smallest" primes of the field 
(where the size of a prime P ~< tt > is defined to be the modulus |7r|). Thus, for 
Q(i) for example, one needs to transmit on a code arising from a maximal order 
inside a division algebra ramified only at (1 -I- 1) and (2 -I- 1) (or (2 — i)). (Much of 
this was part of Vehkalahti's Ph.D thesis.) 

One of the drawbacks of using maximal orders is that the corresponding code 
lattice may not have good shape. Thus, optimizing a code for minimum normalized 
determinant may destroy any optimization for shape. The recent work of Raj 
Kumar and Caire ([5]) proposes a very clever technique of mapping lattice points 
to certain cosets of a suitably chosen sublattice of a standard cubic lattice; this 
smooths out an irregular lattice and gives it better shape. In particular, their 
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technique applies to codes from lattices from maximal orders and provides a further 
performance boost in such cases. 

Key Challenge: Decoding 

What are some of the key problems that need to be solved in space-time codes? 
Perhaps the biggest engineering challenge in the subject is the issue of decoding. 
The problem quite simply is the following: given the received vectors in Y (see 
Equation H])), determine the entries of the matrix X that represent the original 
information. Assume that k symbols are coded in the matrix X and that the 
entries of X are linear in the signal entries si, . . . , Sk (typically arising from Z, 
Z[i], or 1j[uj]). By writing out sequentially the real and imaginary parts of each 
entry of Y, W, and si, . . . , Sk, we may rewrite Equation ([1]) as 1" = Zv + W. 
Here Z is an 2n^ x 2k real matrix that depends on H, 6, and the parameters of 
the code matrix X, v is the signal vector {xi,yi, . . . ,Xi,yi, . . . ,Xk,yk)'^ with Xi 
and yi being the real and imaginary parts of Si, and similarly for Y and W. If 
the columns of Z were orthonormal, decoding would be quite simple: we would 
have Z^Y = v + Z^W with Z^W also having independent, identically distributed 
Gaussian entries. Hence, under maximum likelihood estimation, v can be taken 
to be the closest vector in (viewed inside the Euclidean space R^*^) to Z^Y . 
This is a very simple and computationally fast scheme: we march through Z^Y 
component pair by component pair and we find the clement of the signal lattice S 
closest to that component pair. 

The process above is called single symbol decoding. (For k < this is the same as 
orthogonal projection on to the subspace of K.^" determined by the columns of Z.) 
There are some nice situations where the matrix Z is (essentially) orthogonal: this 
happens in the case of the Alamouti code, and more generally, in the codes satisfying 
Equation The matrix Z for such codes satisfies ZZ'^ = 9'^Tr{HW)l2k- We 
may divide the relation Y ~ Zv + W by 9^jTr{HH^). The entries of the new noise 
vector l/{9^Tr{HH^))W are still independent identically distributed Gaussian, 
while the columns of the matrix l/(9^Tr{HH^))Z are now orthonormal. Thus 
single symbol decoding can be employed in all these cases. 

But for other codes Z is rarely orthogonal! In general, given that the entries 
of W are independent identically distributed Gaussian, for maximum likelihood 
estimation one needs to search in 5" (viewed inside the Euclidean space K^" ) 
for that vector v — (xi, yi, . . . , a;„2, y„2)-^ such that Zv is closest to Y. (Here 
we will assume that fc = n^, as is usually the case for codes from cyclic division 
algebras.) This can no longer be accomplished symbol by symbol, and one needs 

2 

to search in the full space S"" instead of just in 5*. There is an algorithm called 
the sphere decoding algorithm (see ^ for instance) that accomplishes this search 
in an intelligent manner, but as is to be expected of any search in 5" , even this 
algorithm gets very cumbersome once n exceeds 2. (However, in [TT], Hassibi and 
Vikalo show that under certain technical assumptions, the expected complexity of 
the sphere decoding algorithm is polynomial, although the worst case complexity 
is exponential.) 

Since Z is rarely orthogonal, we may ask whether we can take advantage of 
the obvious algebraic structure of the code and simplify the closest vector problem 
for our particular application. A very clever set of ideas of Lizzi et. al. (|15)) 
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does just that, and gives an approximate solution to the decoding problem for the 
Golden Code (Equation [5]) by reducing the situation to the action of SL2{C) on 
three dimensional hyperbolic space H"^. Their work is a veritable tour-de- force of 
the application of abstract mathematics to engineering problems. Their goal is 
to approximate the channel matrix H (normalized to have determinant 1) by an 
element U of determinant 1 in the Z[i] -order R = (Oa'/^Wj '^i Writing H — EU 
with E simply being the error HU^^, they argue that choosing U so that the 
Frobenius norm ofi?^^ = UH^^ is minimized approximates the original problem 
by the following: given a vector Y in C" and an unknown vector S in Z[z]" 
determine a "best" estimate of S if the difference vector W = Y — S is known to be 
approximately independent identically distributed Gaussian (in a suitable sense). 
Given this assumption about the the noise vector W , a reasonable way to proceed 
is to assume that W is actually independent identically distributed Gaussian. In 
this situation, the maximum-likelihood estimate of S is obtained by taking the i-th 
entry of S to be the lattice point in Z[z] closest to the i-th entry of Y. The authors 
find that their scheme gives a fast and acceptably accurate decoding. 

What is fascinating is the mathematics behind their choice of U. First, they 
need to determine generators and relations for the group of norm 1 units Ui{R) 
of R (i.e, the set of multiplicatively invertible elements of R whose determinant 
as a code matrix is 1). In general, it is very difficult to find these for orders 
in division algebras, but in the case of certain special quaternion algebras over 
number fields, generators and relations for Ui{R) is known. Much of the ideas 
behind this goes back to Poincare. The norm 1 units in the order R above (modulo 
the subgroup {±1}) turns out to be a Kleinian group, i.e., a discrete subgroup of 
the projective special linear group PSL2{C). As a subgroup of PSL2{C), Ui{R) 
(modulo {±1}) acts on the upper-half space model of hyperbolic 3-space as a 
group of orientation-preserving isometrics, and Poincare's Fundamental Polyhedron 
Theorem gives a set of generators and relations for such a group in terms of certain 
automorphisms of a fundamental domain for the group. Given a point P in H'', the 
Dirichlet polyhedron centered on P is the closure of the set of points x such that 
dH{x,P) < dnigix), P) for all g G Ui{R) (modulo {±1}), ^ 1, where dn is the 
hyperbolic metric on H^. The authors construct a Dirichlet polyhedron centered on 
J = (0, 0, 1); this is a fundamental domain for Ui{R). From this polyhedron, using 
Poincare's theorem and a computer search, they determine a set of generators of 
Ui{R). They do this ahead of time, and store the results. Next, in real time, given a 
fading matrix H (normalized to have determinant 1), they need to find an element 
U oi Ui{R) such that the Frobenius norm of UH^^ is minimized. They observe 
that viewing UH^^ as an element of PSL2{€.) acting on H'^, the Frobenius norm 
of UH~^ is just 2 cosh dniJ, UH~^{J)), where J and dn are as above. Since U is 
an isometry, they must find U G Ui{R) that minimizes cosh dH{U~^{J), H~^(J)). 
From the definition of Dirichlet polyhedra, this means that they need to find a 
Dirichlet polyhedron centered on some U~^{J) which contains H~^{J). They use 
the geometry of relative to the action of Ui{R) to find such a U: they just 
need to repeatedly consider the various Dirichlet polyhedra centered on J and the 
various gi{J), where the gi run through the generators oilAi{R) that they have 
computed ahead of time, along with their inverses. 
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Role of Mathematicians 

What is the role of mathematicians in this field? The subject is clearly very 
mathematical; yet, unlike classical coding theory which now has a mathematical 
life of its own and can, for instance, be thought of as a theory of subspaces of 
vector spaces over finite fields, the center of gravity of space-time codes currently 
lies very solidly in engineering. There is as yet no deep independent "mathematics 
of space-time codes" : the driving force behind the subject consists of fundamental 
engineering problems that need to be solved before MIMO wireless communica- 
tion reaches its full practical potential, particularly for three or more antennas. 
This author therefore believes that, as things stand now, isolated mathematical 
investigations of space-time codes that are not grounded in concrete engineering 
questions would very likely lead to sterile results. At least for now, mathematicians 
can best contribute to the subject by working in collaboration with engineers who 
are motivated by fundamental engineering questions. This author has found that 
the leading engineers in the field have a practical and intuitive understanding of 
much abstract mathematics, but welcome help from trained mathematicians. (This 
author has also found that they are a genuine pleasure to collaborate with.) There 
is clearly a lot of work for mathematicians to do: particularly in decoding systems 
with large numbers of receive and transmit antennas, but also in other areas of 
MIMO communication that we have not touched upon in this article, such as co- 
operative communication in networks, or noncoherent communication, where the 
matrix H is not known to either the receiver or the transmitter. 
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